Comparison of Filter and Wrapper Based Feature Selection Methods on Spam Comment Classification
نویسندگان
چکیده
The continuous growth of the internet has led to use social media for various purposes increase. For instance, some irresponsible parties take advantage comment feature on platforms harm others by providing spam comments shared object. Furthermore, variation creates many features be processed, thereby negatively impacting performance a classification algorithm. Therefore, this study aims solve problem associated with comparing filter and wrapper based selection using text techniques. Data collected from training test data 4944 100 showed that best accuracy, precision, recall, f-measure MNB are 96%, 100%, 92%, 95.8%. accuracy is achieved combining Chi-Square Sequential Forward Selection methods subset 500 features. increase in SVM classifications 8% 4%. This research concludes combination improves Indonesian language comments.
منابع مشابه
An evaluation of filter and wrapper methods for feature selection in classification
Data mining is a form of knowledge discovery required for solving problems in a specific domain. Classification is a technique used for discovering class labels of unknown data. Different methods for classification exists like bayesian, decision trees, rule based, neural networks etc. Before applying any mining technique, irrelevant and redundant features needs to be removed. Filtering is done ...
متن کاملDeveloping a Filter-Wrapper Feature Selection Method and its Application in Dimension Reduction of Gen Expression
Nowadays, increasing the volume of data and the number of attributes in the dataset has reduced the accuracy of the learning algorithm and the computational complexity. A dimensionality reduction method is a feature selection method, which is done through filtering and wrapping. The wrapper methods are more accurate than filter ones but perform faster and have a less computational burden. With ...
متن کاملon the comparison of keyword and semantic-context methods of learning new vocabulary meaning
the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...
15 صفحه اولFeature Selection and Classification { a Probabilistic Wrapper
Feature selection is deened as a problem to nd a minimum set of M features for an inductive algorithm to achieve the highest predictive accuracy from the data described by the original N features where M N. A probabilistic wrapper model is proposed as another method besides the exhaustive search and the heuristic approach. The aim of this model is to avoid local minima and exhaustive search. Th...
متن کاملFilter- versus wrapper-based feature selection for credit scoring
We address the problem of credit scoring as a classification and feature subset selection problem. Based on the current framework of sophisticated feature selection methods, we identify features that contain the most relevant information to distinguish good loan payers from bad loan payers. The feature selection methods are validated on several real world datasets with different types of classi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IJCCS
سال: 2021
ISSN: ['2460-7258', '1978-1520']
DOI: https://doi.org/10.22146/ijccs.66965